AARI: automatic arabic readability index

نویسندگان

  • Abdel Karim Al Tamimi
  • Manar Jaradat
  • Nuha Al-Jarrah
  • Sahar Ghanem
چکیده

Text readability refers to the ability of the reader to understand and comprehend a given text. In this research, we present our approach to develop an automatic readability index for the Arabic language: Automatic Arabic Readability Index (AARI), using factor analysis. Our results are based on more than 1196 Arabic texts extracted from the Jordanian curriculum in the subjects of: Arabic language, Islamic religion, natural sciences, and national and social education for the elementary classes (first grade through tenth grade). We conduct a comparison study to support our model using cluster analysis and Support Vector Machines (SVM). In order to facilitate the usage of our Arabic readability index, we developed two applications to compute the Arabic text readability: A standalone application and an add-on for Microsoft Word text processer. Through our presented research results and developed tools, we aim to improve the overall readability of Arabic texts, especially those targeted towards the younger generations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media

Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...

متن کامل

Readability index as a design criterion for elicited imitation tasks in automatic oral proficiency assessment

We investigate the effectiveness of using an accepted readability index, the Flesch Reading Ease (FRE) scale, to design prompts for an automatic oral proficiency assessment system. The prompts in question are uttered by the system, and must be repeated from memory by the test subjects, in the form of an elicited imitation exercise. The FRE scores for our prompts are shown to correlate well with...

متن کامل

Automatic summarization as means of simplifying texts, an evaluation for Swedish

We have developed an extraction based summarizer based on a word space model and PageRank and compared the readability of the resulting summaries with the original text, using various measures for Swedish and texts from different genres. The measures include among others readability index (LIX), nominal ratio (NR) and word variation index (OVIX). The measures correspond to the vocabulary load, ...

متن کامل

OSMAN ― A Novel Arabic Readability Metric

We present OSMAN (Open Source Metric for Measuring Arabic Narratives) a novel open source Arabic readability metric and tool. It allows researchers to calculate readability for Arabic text with and without diacritics. OSMAN is a modified version of the conventional readability formulas such as Flesch and Fog. In our work we introduce a novel approach towards counting short, long and stress syll...

متن کامل

Abstracts/Journal of the Arabic Language and LiteratureVol.14, No48, autumn 2018

Contents The Representation of Culture in Arabic pedagogy books to non-Arabic languages Danesh Mohammadi, Sakineh Zarenejad....................................................... 1 Critical Study of the‏‏manifestations of Mamluke's life from the novel “Alsaeroun‏‏niyam...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Int. Arab J. Inf. Technol.

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2014